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1 Fast detection of communication patterns in distributed executions | 

Thomas Kunz, Michiel F, H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced 
Studies on Coiiabprative research 

Publisher: IBM Press 

Full text available: ^ pdf(4.21 MB) Additional Information: full citation , abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the execution 
of the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not 
provide the user with the desired overview of the application. In our experience, such tools 
display repeated occurrences of non-trivial commun ... 



2 GPGPU: general purpose computation on graphics hardware 

David Luebke, Mark Harris, Jens Kruger, Tim Purcell, Naga Govindaraju, Ian Buck, Cliff 
Woolley, Aaron Lefohn 

August 2004 Proceedings of the conference on SIGGRAPH 2004 course notes 
SIGGRAPH '04 

Publisher: ACM Press 

Full text available: ^ pdf(63.03 MB) Additional Information: full citation , abstract 

The graphics processor (GPU) on today's commodity video cards has evolved into an 
extremely powerful and flexible processor. The latest graphics architectures provide 
tremendous memory bandwidth and computational horsepower, with fully programmable 
vertex and pixel processing units that support vector operations up to full IEEE floating 
point precision. High level languages have emerged for graphics hardware, making this 
computational power accessible. Architecturally, GPUs are highly parallel s ... 



3 Multidimensional access methods 
Volker Gaede, Oliver Gunther 

June 1998 ACM Computing Surveys (CSUR), volume 30 issue 2 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: ^pdfn.OSMB) 



Search operations in databases require special support at the physical level. This is true 
for conventional databases as well as spatial databases, where typical search operations 
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include the point query (find all objects that contain a given search point) and the region 
query (find all objects that overlap a given search region). More than ten years of spatial 
database research have resulted in a great variety of multidimensional access methods to 
support ... 

Keywords: data structures, multidimensional access methods 



4 



Approximate query processing using wavelets 

Kaushik Chakrabarti, Minos Garofalakis, Rajeev Rastogi, Kyuseok Shim 

September 2001 The VLDB Journal — The International Journal on Very Large Data 

Bases, volume 10 Issue 2-3 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^ pdf(390.24 KB) Additional Information: full citation , abstract, citings , index terms 

Approximate query processing has emerged as a cost-effective approacli for dealing with 
the huge data volumes and stringent response-time requirements of today's decision 
support systems (DSS). Most work in this area, however, has so far been limited in its 
query processing scope, typically focusing on specific forms of aggregate queries. 
Furthermore, conventional approaches based on sampling or histograms appear to be 
inherently limited when it comes to approximating the results of complex queri ... 

Keywords: Approximate query answers. Data synopses. Query processing. Wavelet 
decomposition 



* External memory algorithms and data structures: dealing \N\th massive data 
Jeffrey Scott Vltter 

June 2001 ACM Computing Surveys (CSUR), volume 33 issue 2 
Publisher: ACM Press 

F II text available- 1^ Ddf(828 46 KB) Additional Information: full citation , abstract , references , citings , index 
" ^ ' = ^ terms 

Data sets in large applications are often too massive to fit completely inside the computers 
internal memory. The resulting input/output communication (or I/O) between fast internal 
memory and slower external memor/ (such as disks) can be a major performance 
bottleneck. In this article we survey the state of the art in the design and analysis of 
external memory (or EM) algorithms and data structures, where the goal is to exploit 
locality in order to reduce the I/O costs. We consider a varie ... 

Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external 
memory, hierarchical memory, multidimensional access methods, multilevel memory, 
online, out-of-core, secondary storage, sorting 



Rule-based spatiotemporal q uer y processing for video databases 
Mehmet Emin Donderler, Ozgiir Ulusoy, Ugur Giidukbay 

January 2004 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 13 Issue 1 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^pdf(231.09 KB) Additional Information: full citation , abstract , index terms 

In our earlier work, we proposed an architecture for a Web-based video database 
management system (VDBMS) providing an integrated support for spatiotemporal and 
semantic queries. In this paper, we focus on the task of spatiotemporal query processing 
and also propose an SQL-like video query language that has the capability to handle a 
broad range of spatiotemporal queries. The language is rule-based in that it allows users 
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to express spatial conditions in terms of Prolog-type predicates. Spatiote ... 

Keywords: Content-based retrieval, Inference rules, Multimedia databases, 
Spatlotemporal query processing, Video databases 



7 A case for fractured mirrors 

Ravlshankar Ramamurthy, David J. DeWitt, Qi Su 

August 2003 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 12 Issue 2 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^ pdf(200.49 KB) Additional Information: full citation , abstract , index terms 

Abstract.The decomposition storage model (DSM) vertically partitions all attributes of a 
table and has excellent I/O behavior when the number of attributes accessed by a query is 
small. It also has a better cache footprint than the standard storage model (NSI^) used by 
most database systems. However, DSM Incurs a high cost in reconstructing the original 
tuple from its partitions. We first revisit some of the performance problems associated with 
DSM and suggest a simple indexing strategy and compa ... 

Keywords: Data placement. Disk mirroring, Vertical partitioning 




Recent technical reports 

December 1979 ACM SIGACT News, Volume lo issue 3 
Publisher: ACM Press 

Full text available: ^ pdf(1.99 MB) Additional Information: full citation 



A qraph-theoretic model for optimizing queries involving methods 
Chiang Lee, Chi-Sheng Shih, Yaw-Huei Chen 

April 2001 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 9 Issue 4 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^ pdf(266.86 KB) Additional Information: full citation , abstract , index terms 

Traditional algorithms for optimizing the execution order of joins are no more valid when 
selections and projections Involve methods and become very expensive operations. 
Selections and projections could be even more costly than joins such that they are pulled 
above joins, rather than pushed down In a query tree. In this paper, we take a 
fundamental look at how to approach query optimization from a top-down design 
perspective, rather than trying to force one model to fit into another. We present ... 

Keywords: Graph model. Method query. Object-oriented databases, Query optimization. 
Spanning tree 



Implementing declarative overlays | 
Boon Thau Loo, Tyson Condie, Joseph M. Hellerstein, Petros Maniatis, Timothy Roscoe, Ion 
Stoica 

October 2005 ACM SIGOPS Operating Systems Review , Proceedings of tlie twentieth 
ACM symposium on Operating systems principles SOSP '05, volume 39 issue 
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Publisher: ACM Press 

Full text available: ^ pdf(370.85 KB) Additional Information: full citation , abstract , references , index terms 
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Overlay networks are used today in a variety of distributed systems ranging from file- 
sharing and storage systems to communication infrastructures. However, designing, 
building and adapting these overlays to the intended application and the target 
environment is a difficult and time consuming process.To ease the development and the 
deployment of such overlay networks we have implemented P2, a system that uses a 
declarative logic language to express overlay networks In a highly compact and reusab ... 

Keywords: dataflow engines, declarative overlays, executable pseudocode 



1'' On randomization in sequential and distributed algorithms | 
Rajiv Gupta, Scott A. Smolka, Shaji Bhaskar 
March 1994 ACM Computing Surveys (CSUR), Volume 26 issue i 

Publisher: ACM Press 

Full text available- j ggl Ddf(8.01 MB) Additional Information: full citation , abstract , references, citings, index 
. la^^ * • terms 

Probabilistic, or randomized, algorithms are fast becoming as commonplace as 
conventional deterministic algorithms. This survey presents five techniques that have been 
widely used in the design of randomized algorithms. These techniques are Illustrated using 
12 randomized algorithms— both sequential and distributed— that span a wide range of 
applications, including :prlmality testing (a classical problem in number theory), interactive 
probabilistic proof s ... 

Keywords: Byzantine agreement, CSP, analysis of algorithms, computational complexity, 
dining philosophers problem, distributed algorithms, graph isomorphism, hashing, 
interactive probabilistic proof systems, leader election, message routing, nearest- 
neighbors problem, perfect hashing, primality testing, probabilistic techniques, randomized 
or probabilistic algorithms, randomized quicksort, sequential algorithms, transitive 
tournaments, universal hashing 



The Quadtree and Related Hierarchical Data Structures 
Hanan Samet 

June 1984 ACM Computing Surveys (CSUR), Volume 16 issue 2 
Publisher: ACM Press 

Full text available: ^pdf(4.87 MB) Additional Information: full citation , references , citinos. index terms 



''S The elements of nature: interactive and realistic techniques 

Oliver Deusen, David S. Ebert, Ron Fedklw, F. Kenton Musgrave, Przemyslaw Prusinkiewicz, 
Doug Roble, Jos Stam, Jerry Tessendorf 

August 2004 Proceedings of the conference on SIGGRAPH 2004 course notes 
SIGGRAPH '04 

Publisher: ACM Press 

Full text available: ^pdfn7.65 MB) Additional Information: full citation , abstract 

This updated course on simulating natural phenomena will cover the latest research and 
production techniques for simulating most of the elements of nature. The presenters will 
provide movie production, Interactive simulation, and research perspectives on the difficult 
task of photorealistic modeling, rendering, and animation of natural phenomena. The 
course offers a nice balance of the latest interactive graphics hardware-based simulation 
techniques and the latest physics-based simulation techni ... 
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❖ Marc Olano, Kurt Akeley, John C. Hart, Wolfgang Heidrich, Michael McCool, Jason L Mitchell, 
Rand! Rost 

August 2004 Proceedings of the conference on SIGGRAPH 2004 course notes 
SIGGRAPH '04 

Publisher: ACM Press 

Full text available: ^ pdf(7.39 MB) Additional Information: full citation , abstract 

Real-time procedural shading was once seen as a distant dream. When the first version of 
this course was offered four years ago, real-time shading was possible, but only with one- 
of-a-kihd hardware or by combining the effects of tens to hundreds of rendering passes. 
Today, almost every new computer comes with graphics hardware capable of interactively 
executing shaders of thousands to tens of thousands of Instructions. This course has been 
redesigned to address today's real-time shading capabili ... 

15 The Grid File: An Adaptable. Symmetric Multikey File Structure Q 

J. Nievergelt, Hans Hinterberger, Kenneth C. Sevcik 
^ March 1984 ACM Transactions on Database Systems (TODS), volume 9 issue i 

Publisher: ACM Press 

Full text available: ^ odfrzaSMB) Additional Information: Mition, abstrad. references , dtirifls. index 

Traditional file structures that provide multikey access to records, for example, inverted 
files, are extensions of file structures originally designed for single-key access. They 
manifest various deficiencies in particular for multikey access to highly dynamic files. We 
study the dynamic aspects of file structures that treat all keys symmetrically, that is, file 
structures which avoid the distinction between primary and secondary keys. We start from 
a bitmap approach and treat the problem ... 



Special issue: A! in engineering 
D. Sriram, R. Joobbani 
April 1985 ACM SIGART Bulletin, issue 92 
Publisher: ACM Press 

Full text available:^ pdf(8.79 MB) Additional Information: full citation , abstract 

The papers in this special issue were compiled from responses to the announcement in the 
July 1984 issue of the SIGART newsletter and notices posted over the ARPAnet. The 
interest being shown In this area is reflected in the sixty papers received from over six 
countries. About half the papers were received over the computer network. 




Data clustering: a review 
A. K. Jain, M. N. Murty, P. J. Flynn 

September 1999 ACM Computing Surveys (CSUR), volume 3i issue 3 
Publisher: ACM Press 

Full text available: « pdf(636.24 KB) Additional Information: full citation, abstract, references , dfings. index 
^ terms , review 

Clustering is the unsupervised classification of patterns (observations, data items, or 
feature vectors) into groups (clusters). The clustering problem has been addressed in 
many contexts and by researchers in many disciplines; this reflects its broad appeal and 
usefulness as one of the steps in exploratory data analysis. However, clustering is a 
difficult problem combinatorially, and differences in assumptions and contexts In different 
communities has made the transfer of useful generic co ... 

Keywords: cluster analysis, clustering applications, exploratory data analysis, incremental 
clustering, similarity indices, unsupervised learning 



http://portal.acm.org/results.cfm?CFID=314664&CFTOKEN==42545794&^^ 9/8/2006 



Results (page 1): +query, +decompose, +span, +size, Grange, +partition, ^populate, +rand... Page 6 of 7 



18 Practical methods for constructing suffix trees 
Yuanyuan Tian, Sandeep Tata, A. Hankins, M. Patel 

September 2005 The VLDB Journal — The International Journal on Very Large Data 

Bases, volume 14 Issue 3 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^pdf(1.30 MB) Additional Information: full citation , abstract 

Sequence datasets are ubiquitous in modern life-science applications, and querying 
sequences is a common and critical operation In many of these applications. The suffix 
tree Is a versatile data structure that can be used to evaluate a wide variety of queries on 
sequence datasets, including evaluating exact and approximate string matches, and 
finding repeat patterns. However, methods for constructing suffix trees are often very 
time-consuming, especially for suffix trees that are large and do no ... 

Keywords: Suffix tree construction, sequence matching 



19 A qualitative comparison study of data structures for large line segment databases Q 
Erik G. Hoel, Hanan Samet 

June 1992 ACM SIGMOD Record , Proceedings of the 1992 ACM SIGMOD international 

conference on Management of data SIGMOD '92, volume 21 issue 2 
Publisher: ACM Press 

Full text available* HI Ddffl 42 MB) Additional Information: full citation, abstract, references, citings, index 
' ^ terms 

A qualitative comparative study is performed of the performance of three popular spatial 
indexing methods - the R-tree, R+-tree, and the PMR quadtree-in the context of 
processing spatial queries In large line segment databases. The data is drawn from the 
TIGER/Line files used by the Bureau of the Census to deal with the road networks in the 
US. The goal is not to find the best data structure as this Is not generally possible. 
Instead, their comparability is demonstrated ... 

Algorithmic issues in modeling motion Q 
Pankaj K. Agarwal, Leonidas 3. Guibas, Herbert Edelsbrunner, Jeff Erickson, Michael Isard, 
Sariel Har-Peled, John Hershberger, Christian Jensen, Lydia Kavraki, Patrice Koehl, Ming Lin, 
DInesh Manocha, DImltris Metaxas, Brian MIrtlch, David Mount, S. Muthukrishnan, DInesh 
Pal, Elisha Sacks, Jack Snoeyink, Subhash Suri, Ouri Wolefson 
December 2002 ACM Computing Surveys (CSUR), Volume 34 Issue 4 
Publisher: ACM Press 

Full text available: g pdf(205,25 KB) Additional Infonrnation: full citation, abstract, references, citings, ind^. 

: terms 

This article is a survey of research areas in which motion plays a pivotal role. The aim of 
the article Is to review current approaches to modeling motion together with related data 
structures and algorithms, and to summarize the challenges that lie ahead in producing a 
more unified theory of motion representation that would be useful across several 
disciplines. 

Keywords: Computational geometry, computer vision, mobile networks, modeling, 
molecular biology, motion modeling, physical simulation, robotoics, spatio-temporal 
databases 
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